Crowding—An essential bottleneck for object recognition: A mini-review
نویسنده
چکیده
Crowding, generally defined as the deleterious influence of nearby contours on visual discrimination, is ubiquitous in spatial vision. Crowding impairs the ability to recognize objects in clutter. It has been extensively studied over the last 80 years or so, and much of the renewed interest is the hope that studying crowding may lead to a better understanding of the processes involved in object recognition. Crowding also has important clinical implications for patients with macular degeneration, amblyopia and dyslexia. There is no shortage of theories for crowding-from low-level receptive field models to high-level attention. The current picture is that crowding represents an essential bottleneck for object perception, impairing object perception in peripheral, amblyopic and possibly developing vision. Crowding is neither masking nor surround suppression. We can localize crowding to the cortex, perhaps as early as V1; however, there is a growing consensus for a two-stage model of crowding in which the first stage involves the detection of simple features (perhaps in V1), and a second stage is required for the integration or interpretation of the features as an object beyond V1. There is evidence for top-down effects in crowding, but the role of attention in this process remains unclear. The strong effect of learning in shrinking the spatial extent of crowding places strong constraints on possible models for crowding and for object recognition. The goal of this review is to try to provide a broad, balanced and succinct review that organizes and summarizes the diverse and scattered studies of crowding, and also helps to explain it to the non-specialist. A full understanding of crowding may allow us to understand this bottleneck to object recognition and the rules that govern the integration of features into objects.
منابع مشابه
An Information-Theoretic Discussion of Convolutional Bottleneck Features for Robust Speech Recognition
Convolutional Neural Networks (CNNs) have been shown their performance in speech recognition systems for extracting features, and also acoustic modeling. In addition, CNNs have been used for robust speech recognition and competitive results have been reported. Convolutive Bottleneck Network (CBN) is a kind of CNNs which has a bottleneck layer among its fully connected layers. The bottleneck fea...
متن کاملUncorking the bottleneck of crowding: a fresh look at object recognition
In crowding, the perception of a target deteriorates in the presence of clutter. Crowding is usually explained within the framework of object recognition, where processing proceeds in a hierarchical and feedforward fashion from the analysis of low level features, such as lines and edges, to high level features, such shapes and objects. Here, reviewing work of the last two years, we will show ev...
متن کاملVisual crowding: a fundamental limit on conscious perception and object recognition.
Crowding, the inability to recognize objects in clutter, sets a fundamental limit on conscious visual perception and object recognition throughout most of the visual field. Despite how widespread and essential it is to object recognition, reading and visually guided action, a solid operational definition of what crowding is has only recently become clear. The goal of this review is to provide a...
متن کاملAttentional priming releases crowding.
Views of natural scenes unfold over time, and objects of interest that were present a moment ago tend to remain present. While visual crowding places a fundamental limit on object recognition in cluttered scenes, most studies of crowding have suffered from the limitation that they typically involved static scenes. The role of temporal continuity in crowding has therefore been unaddressed. We in...
متن کاملPerceived Positions Determine Crowding
Crowding is a fundamental bottleneck in object recognition. In crowding, an object in the periphery becomes unrecognizable when surrounded by clutter or distractor objects. Crowding depends on the positions of target and distractors, both their eccentricity and their relative spacing. In all previous studies, position has been expressed in terms of retinal position. However, in a number of situ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Vision Research
دوره 48 شماره
صفحات -
تاریخ انتشار 2008